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ABSTRACT  j 

Even  after  an  unambiguous  semantic  interpretation  has  been  comput^  loi  a  senierice 
in  context,  there  are  at  least  three  reasons  that  a  system  may  map  the  semantic  representation 
R  into  another  form  S. 

1.  The  terms  of  R.  while  reflecting  the  user  view,  may  require  deeper  understanding. 
e.g.  may  require  a  version  S  where  metaphors  have  been  analyzed. 


2.  Transformations  of  R  may  be  more  appropriate  for  the  underlying  application 
system.  e.g.  S  may  be  a  more  nearly  optimal  form.  These  transformations  may  not 
be  linguisticly  motivated. 


3.  Some  transformations  may  depend  on  non-^tructural  context 

Design  considerations  may  favor  factoring  the  process  into  two  stages,  for  reasons  of 
understandability  or  for  easier  transportability  of  the  components. 


This  paper  describes  the  use  of  Horn  clauses  for  the  three  classes  of  transformations 
listed  above.  The  transformations  are  part  of  a  system  that  converts  the  English  description  of 
a  software  module  into  a  formal  specification.  i.e.  an  abstract  dau  type. 


1.  INTRODUCTION 

Parsing,  semantic  interpretation,  definite  reference  resolution,  quantifier  scope  decisions, 
and  determining  the  intent  of  a  speaker /author  are  well-known  problems  of  natural  langaugc 
understanding.  Yet.  even  after  a  system  has  generated  a  semantic  represenution  R  where  such 
decisions  have  been  made,  there  may  still  be  a  need  for  further  transformation  and 
understanding  of  the  input  to  generate  a  representation  S  for  the  underlying  application  system. 
There  are  at  least  three  reasons  for  this. 

First,  consider  spatial  metaphor.  Understanding  spatial  metaphor  seems  to  require 
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computing  some  concrete  interpretation  S  for  tbe  metaphor:  however,  understanding  the 
metaphor  concretely  may  be  attempted  after  computing  a  semantic  representation  R  that 
represents  the  spatial  metaphor  formally  but  without  full  understanding.  Explaining  the 
system’s  interpretation  of  a  user  input  (e.g.  for  clarification  dialog,  allowing  the  user  to  check 
the  system’s  undersanding,  etc.)  is  likely  to  be  more  understandable  if  the  terminology  of  the 
user  is  employed.  By  having  an  intermediate  level  of  understanding  such  as  R.  and  generating 
English  output  from  it.  one  may  not  have  to  recreate  the  metaphor,  for  the  terms  in  R  use  it 
as  a  primitive. 

Second,  the  needs  of  the  underlying  application  system  may  dictate  transformations  that 
are  neither  essential  to  understanding  the  English  text  nor  linguisticly  motivated.  In  a  dau 
base  environment,  transformations  of  the  semantic  representation  may  yield  a  retrieval  request 
that  is  computationally  less  demanding  [King  80],  To  promote  portability,  EUFID  [Templeton 
83]  and  TQA  [Damerau  81]  are  interfaces  that  have  a  separate  component  for  transformations 
specifie  to  the  data  base.  In  software  specification,  mapping  of  the  semantic  representation  R 
may  yield  a  form  S  which  is  more  amenable  for  proving  theorems  about  the  specification  or 
for  rewriting  it  into  some  standard  form. 

The  following  example,  derived  from  a  definition  of  stacks  on  page  77  of  [Horowitz 
76]  illustrates  both  of  the  reasons  above. 

A  stack  is  an  ordered  list  in  which  all  insertions  and  deletions  occur  at  one  end 
called  the  top. 

A  theorem  prover  for  abstract  data  types  would  normally  assume  that  the  end  of  the  stack  in 
question  is  referred  to  by  a  notation  such  as  A[l]  if  A  is  the  name  of  the  stack,  rather  than 
understanding  the  spatial  metaphor  "one  end". 

Third,  it  may  be  convenient  to  design  the  transformation  process  in  two  phases,  where 
the  output  of  both  phases  is  a  semantic  representation.  In  our  system,  we  have  chosen  to  map 
ceruin  paraphrases  into  a  common  form  via  a  two  step  process.  The  forms  "ith  clement"  and 
"element  i"  each  generate  the  same  term  as  a  result  of  semantic  interpretation.  However,  the 
semantic  interpreter  generates  another  term  for  "element  at  position  i"  due  to  the  extra  lexical 
items  "at"  and  "position".  Obviously,  all  three  expressions  correspond  to  one  concept.  The 
mapping  component  recognizes  that  the  two  terms  generated  by  the  semantic  interpreter  arc 
paraphrases  and  maps  them  into  one  form. 

Section  2  gives  an  overview  of  the  system  as  a  whole.  Section  3  describes  the  use  of 
Horn  clauses  for  the  mapping  from  R  to  S.  Related  research  and  our  conclusions  are  presented 
in  sections  4  and  5. 


3 


2.  BRIEF  SYSTEM  OVERVIEW 

The  overall  system  contains  several  components  beside  the  mapping  component  that  is 
the  focus  of  this  paper.  The  system  takes  as  input  short  English  texts  such  as  the  data 
structure  descriptions  in  [Horowitz  76].  The  output  is  a  formal  specification  of  the  data 
structure  defined  in  Horn  clauses^.  [In  a  full  version  of  the  paper,  a  sample  text  of  about  10 
sentences  and  the  Horn  clause  output  will  appear  here.] 

First,  the  RUS  parser  [Bobrow  78],  which  includes  a  large  general-purpose  grammar 
of  English,  calls  a  semantic  component  to  incrementally  compute  the  semantic  interpretation  of 
the  phrases  being  proposed.  As  soon  as  a  phrase  is  proposed  by  the  grammar,  the  semantic 
interpreter  either  generates  a  semantic  interpretation  for  the  phrase  or  vetoes  the  parse.  The 
only  modifications  to  adapt  the  parser  to  the  application  of  abstract  data  types  were  to  add 

mathematical  notation,  so  that  phrases  such  as  “the  list  (A[l].  A  [2] .  A[N])’'  could  be 

understood.  The  semantic  component  we  developed  employs  case  frames  for  disambiguation 
and  generation  of  the  semantic  interpretation  of  *a  phrase.  However,  the  semantic  component 
does  not  make  quantifier  scope  decisions. 

Quantifier  scope  decisions,  reference  resolution,  and  conversion  from  first-order  logic 
to  Horn  clauses  is  performed  after  the  semantic  interpreter  has  completed  its  processing.  The 
knowledge  governing  these  three  tasks  is  itself  encoded  in  Horn  clauses  and  was  developed  by 
Daniel  Chester.  The  output  from  this  component  is  the  input  to  the  mapping  component, 
which  is  the  focus  of  this  paper. 

The  semantic  representation  R  of  a  single  sentence  is  therefore  a  set  of  Horn  clauses. 
In  addition,  the  model  of  context  built  in  understanding  the  text  up  to  the  current  sentence  is 
a  set  of  Horn  clauses  and  a  list  of  entities  which  could  be  referenced  in  succeeding  sentences. 
The*  mapping  component  performs  the  three  tasks  described  in  the  previous  section  to  generate 
a  set  S  of  Horn  clauses.  S  is  added  to  the  model  of  context  prior  to  processing  the  next 
input  sentence. 

The  choice  of  Horn  clauses  as  the  formal  representation  of  the  abstract  data  type  is 
based  on  the  following  motivations: 

1.  Once  a  text  has  been  understood,  the  set  of  Horn  clauses  can  be  added  to  the 
knowledge  base  (which  is  also  encoded  as  Horn  clauses).  This  offers  the  potential  of 
a  system  that  grows  in  power. 

2.  The  semantics  of  Horn  clauses,  their  use  in  theorem  proving,  and  their  execuiability 
makes  them  an  appropriate  formalism  for  defining  abstract  data  types. 

3.  A  Horn  clause  theorem  prover  [Chester  80]  allowing  free  intermixing  of  lisp  and 
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theorem  proving  is  readily  available. 


3.  MAPPING  IN  THE  SYSTEM 

The  rules  of  the  mapping  component  are  ail  encoded  as  Horn  clauses.  The  antecedent 
atomic  formulas  of  our  rules  specify  either 

1.  the  structural  change  to  be  made  in  the  collection  of  formulas  or 

2.  conditions  vhich  are  not  structural  in  nature  but  which  must  be  true  if  the  mapping 
is  to  apply. 

We  will  use  the  notation  (MAPPING-RULE  (al  ...  am)  ?x  (cl  ...  ck)  ?y)  to  mean  that  the 
atomic  formulas  al  ...  am  must  be  present  in  the  list  ?x  of  atomic  formulas;  the  list  ?x  of 
formulas  is  assumed  to  be  implicitly  conjoined.  The  variable  ?y  will  be  bound  to  the  result  of 

replacing  the  formulas  al .  am  in  ?x  with  the  formulas  cl .  ck.  There  is  a  map  between 

two  lists,  ?x  and  7y.  of  atomic  formulas  if  (MAP  7x  7y)  is  true. 

The  two  examples  given  in  the  introduction  are  detailed  next  For  expository  purposes 
the  rules  given  in  this  section  have  been  simplified. 

Consider  the  following  example; 

A  stack  is  an  ordered  list  in  which  all  insertions  and  deletions  occur  at  one  end 
called  the  top.  ADD(1.S)  adds  item  1  to  stack  S. 

In  this  environment  spatial  metaphors  tend  tp  ^  more  frozen  than  creative.  To  undersund 
"one  end",  we  assume  the  following  rules: 

1.  For  a  sequence  ?D,  we  may  map  "?E  is  an  end  of  ?D"  to  "?E  is  a  the  first 
sequence  element  of  ?D". 

2.  An  ordered  list  is  a  sequence. 

Facts  (1)  and  (2)  are  encoded  as  Horn  clauses  below. 

1.  (SEQUENCE  ?D)  (MAPPING-RULE  ((END  7E  ?D))  ?X 

((SEQUENCE-ELEMENT  7E  1  ?D))  7Y) 

=>  (MAP  7X  ?Y) 

2.  (ORDERED-LIST  7D)  =>  (SEQUENCE  7D) 

The  system  knows  how  to  map  the  notion  of  "end  of  a  sequence",  and  it  knows  that  ordered 
lists  are  sequences.  Since  the  first  sentence  is  discussing  the  end  of  an  ordered  list,  the  two 
rules  above  are  sufficient  to  map  "end”  into  the  appropriate  concrete  semantic  representation. 
The  power  and  generality  of  this  approach  is  that 

-  a  chain  of  reasoning  may  show  how  to  view  some  entity  7D  as  a  sequence  (and 
therfore  the  rules  show  how  to  interpret  "end  of  ?D"),  and 

-  other  mapping  rules  may  state  how  to  interpret  spatial  metaphors  unrelated  to  "end" 


j. 


[A  long  version  of  the  paper  can  give  rules  for  how  the  sense  "add"  being  defined  in  the 
example  above  can  be  interpreted  in  this  environment] 

The  second  example  from  the  introduction  involves  mapping  the  forms  "ith  element", 
"element  i".  and  "element  at  position  i"  into  the  same  representation.  Assume  that  the 
semantic  interpreter  generates  for  each  of  the  first  two  the  list  of  formulas  ((ELEMENT  ?X) 
(IDENTIFIED-BY  ?X  ?Y)).  The  Horn  clause  for  that  mapping  is  as  follows: 

s 

(SEQUENCE  7T)  (TOPIC  TT) 

(MAPPING-RULE  ((ELEMENT  ?X)  (IDENTIFIED-BY  ?X  ?Y))  ?W 
((SEQUENCE-ELEMENT  7X  7Y  7T))  7Z) 

=>  (MAP  7W  7Z) 

Note  that  this  rule  assumes  that  in  context  some  sequence  7T  has  been  identified  as  the  topic; 
the  rule  identifies  that  the  element  7X  is  the  7Yth  member  of  the  sequence  7T.  For  the 
phrase  "element  at  position  i".  assume  the  semantic  interpreter  generates  the  list  of  formulas 
((ELEMENT  7X)  (AT  7X  (POSITION  7P))  (IDENTIFIED-BY  7P  7Y)).  The  mapping  rule  for  it 
is  similar  to  the  one  above. 

(SEQUENCE  7T)  (TOPIC  7T) 

(MAPPING-RULE  ((ELEMENT  7X)  (AT  7X  (POSITION  7P))  (IDENTIFIED-BY  ?P  ?Y))  7W 
((SEQUENC:E-ELEMENT  7X  7Y  7T))  7Z) 

=>  (MAP  7W  7Z) 

This  second  rule  must  be  tried  before  the  prior  one. 

The  mapper  halts  when  no  more  rules  can  be  applied. 

4.  RELATED  WORK 

A  number  of  applied  A1  systems  have  been  developed  to  support  automating  software 
construction  [Balzcr  78.  Green  76,  Biermann  80.  Gomez  82}.  Of  these,  only  our  effort  has 
focussed  on  the  mapping  problem. 

Viewing  spatial  metaphors  in  terms  of  a  scale  was  proposed  in  [Hobbs  77].  Our 
model  is  somewhat  more  general  in  that  the  inference  process 

-  permits  specific  constraints  for  each  metaphor,  not  just  the  one  view  of  a  scale,  and 

-  accounts  for  other  mapping  problems  in  addition  to  spatial  metaphor. 

A  very  similar  approah  to  mapping  has  been  proposed  in  [Mark  80].  Instead  of  using 
Horn  clauses  as  the  formalism  for  mapping,  they  encode  their  rules  in  KL-ONE  [Brachman 
78].  The  concern  in  [Mark  80]  is  inferring  the  appropriate  service  to  perform  in  response  to 


a  user  request,  rather  than  demonstrating  means  of  interpreting  spatial  metaphors  or  of  finding 
contextually  dependent  paraphrases. 

5.  CONCLUSIONS 

There  are  several  reasons  why  one  may  have  a  second  transduction  phase  even  after  a 
semantic  representation  for  an  utterance  has  been  computed.  The  advantage  of  using  Horn 
clauses  (or  any  other  deduction  mechanism)  in  this  mapping  phase  is  the  ability  to  include 
nonstructural  conditions.  This  means  that  the  mapping  rules  may  be  based  on  reasoning  about 
the  context 

There  are  three  areas  for  future  work; 

-  generating  mapping  rules  based  on  additional  texts, 

-  investigating  use  of  the  mapping  component  in  reference  resolution,  and 

-  developing  an  indexing  technique  to  run  the  mapper  in  a  forward  chaining  mode. 
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